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ABSTRACT 

An RNA-dependent RNA polymerase (RdRp) is the 
central catalytic subunit of the RNA-synthesizing 
machinery of all positive-strand RNA viruses. 
Usually, RdRp domains are readily identifiable by 
comparative sequence analysis, but biochemical 
confirmation and characterization can be ham¬ 
pered by intrinsic protein properties and technical 
complications. It is presumed that replication and 
transcription of the ~30-kb severe acute respiratory 
syndrome (SARS) coronavirus (SARS-CoV) RNA 
genome are catalyzed by an RdRp domain in the 
C-terminal part of nonstructural protein 12 (nsp12), 
one of 16 replicase subunits. However, thus far 
full-length nsp12 has proven refractory to expres¬ 
sion in bacterial systems, which has hindered both 
the biochemical characterization of coronavirus 
RNA synthesis and RdRp-targeted antiviral drug 
design. Here, we describe a combined strategy 
involving bacterial expression of an nsp12 fusion 
protein and its in vivo cleavage to generate and 
purify stable SARS-CoV nsp12 (106 kDa) with a 
natural N-terminus and C-terminal hexahistidine 
tag. This recombinant protein possesses robust 
in vitro RdRp activity, as well as a significant DNA- 
dependent activity that may facilitate future inhibitor 
studies. The SARS-CoV nsp12 is primer dependent 
on both homo- and heteropolymeric templates, 
supporting the likeliness of a close enzymatic col¬ 
laboration with the intriguing RNA primase activity 
that was recently proposed for coronavirus nsp8. 


INTRODUCTION 

The central enzymatic activity in the life cycle of 
positive-strand RNA (+ RNA) viruses is the process of 


RNA-templated RNA synthesis. In mammalian cells, the 
replication of + RNA genomes always occurs in the cyto¬ 
plasm where the viral RNA-synthesizing machinery is 
commonly associated with (modified) cytoplasmic 
membranes, which presumably provide physical support 
and may serve to protect the complex from host defense 
mechanisms (1,2). The viral enzyme complex contains an 
RNA-dependent RNA polymerase (RdRp) as its core 
catalytic subunit for the production of negative strand 
RNA (-RNA), new genome molecules and in many 
virus groups also subgenomic (sg) messenger RNAs 
(mRNAs) (1,3). 

Coronaviruses (CoVs) are a family of +RNA viruses 
that infect a wide range of vertebrates including humans, 
as exemplified by severe acute respiratory syndrome 
(SARS) and various respiratory infections caused by 
less-virulent family members (4). The group is also 
known for its exceptionally large polycistronic genome 
of ~30kb, which is 5' capped and 3' polyadenylated 
RNA. The 5'-proximal two-thirds of the CoV genome 
(open reading frames la and lb) are translated into 
the viral replicase polyproteins ppla and pplab. The 
3'-proximal third encodes the viral structural proteins 
and several so-called accessory proteins, which can only 
be expressed following the production of a set of four to 
nine sg mRNAs (5,6). In particular, it is this CoV 
hallmark that has attracted attention over the years, 
since these sg mRNAs consist of sequences that are 
noncontiguous in the viral genome: a common 5' leader 
sequence is attached to body segments representing a 
variable part of the 3'-proximal genomic region. 
According to the now widely supported transcription 
model of Sawicki and Sawicki (7), the arterivirus and 
CoV sg RNAs derive from a discontinuous step during 
minus-strand RNA synthesis that is guided by specific 
RNA signals and resembles copy-choice RNA recombina¬ 
tion (5,6,8). This mechanism produces a set of subgenome- 
length negative-strand templates from which the sg 
mRNAs are produced. The unusual genome size of 
CoVs and the complexity of their RNA synthesis hint at 
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an RdRp that may have special properties, either by itself 
or in conjunction with other subunits of the CoV replica¬ 
tion/transcription complex (RTC). 

CoV replication and transcription are driven by the 15 
or 16 viral nonstructural proteins (nsps) encoded in the 
replicase gene, which include many enzyme activities that 
are rare or lacking in other families of +RNA viruses 
(9,10). These nsps are produced during co- and post- 
translational processing of the ppla and pplab replicase 
polyproteins by two or three internal viral proteinases 
(11). In the years following the 2003 SARS outbreak, 
bioinformatics, structural biology, (reverse) genetics and 
biochemical studies have all contributed to the in-depth 
characterization of CoV nsps in general and those of 
SARS-CoV in particular (9,10,12). Currently documented 
enzyme activities include proteinases (11), a putative RNA 
primase (13), a superfamily 1 helicase (14), an exo- and an 
endoribonuclease (15,16), single-stranded RNA (ssRNA)- 
binding proteins (17,18) and two methyltransferases 
(19,20). Following the sequence analysis of the first CoV 
genomes, more than two decades ago, the putative RdRp 
domain of the coronavirus RTC had already been 
identified by comparative sequence analysis (21,22). The 
C-terminal, two-thirds of the 932-amino-acid subunit, 
now known as nspl2 was confidently aligned with the 
conserved motifs of well-known RdRps (22,23). In view 
of its unprecedented size among viral RdRp subunits, 
which commonly consist of 500-600 amino acids, nspl2 
may well harbor other functional domains in its as-yet 
uncharacterized N-terminal domain. Biochemical infor¬ 
mation on the CoV RdRp has, however, remained 
scarce thus far, particularly since full-length nspl2 was 
refractory to expression in bacterial systems. Preliminary 
evidence for the in vitro RdRp activity of nspl2 came from 
a study of the SARS-CoV enzyme by Cheng el al. (24). 
However, their experiments, employing a glutathione 
S-transferase (GST)-nspl2 fusion protein, were ham¬ 
pered by protein instability, which resulted in the fragmen¬ 
tation of the protein into three parts. Primer-dependent 
activity on poly(A) templates was observed in filter 
binding assays, but the enzyme biochemistry was not 
further characterized in detail. 

Given its pivotal role in viral replication and the efficacy 
of polymerase inhibitors used to combat other virus 
infections, the SARS-CoV RdRp is widely regarded as 
an important and attractive target for the rational 
design of anti-CoV drugs (23,25). We therefore sought 
to solve the technical issues described above and develop 
reliable protocols for stable expression, purification and 
in vitro activity of the full-length CoV nspl2. In this 
article, we describe an expression strategy utilizing in 
vivo cleavage in a bacterial expression system by ubiquitin 
carboxyl-terminal hydrolase 1 (Ubpl), which releases the 
small N-terminal fusion partner ubiquitin (Ub) and 
generates a recombinant nspl2 protein containing its 
natural N-terminus. This was found to be a critical step 
on the road toward purification of a stable recombinant 
SARS-CoV RdRp. We subsequently established that full- 
length nspl2 can readily synthesize RNA in a primer- 
dependent fashion on both homo- and heteropolymeric 
RNA templates and is also efficient at incorporating 


deoxy-nucleotides on a DNA template. This work there¬ 
fore provides a solid basis for the further biochemical 
characterization of the coronavirus nspl2 and the 
detailed dissection of its role in viral replication and tran¬ 
scription. Understanding the enzyme’s properties will be 
essential for the development of selective coronavirus 
RdRp inhibitors and will likely provide novel insights 
into the intriguing (evolutionary) relation between the 
RdRp of nidoviruses and other groups of + RNA viruses. 

MATERIALS AND METHODS 

Cloning and expression of recombinant SARS-CoV nspl2 

The amino acid sequence of SARS-CoV nspl2, residues 
S4370 to Q5301 of replicase pplab, is fully conserved 
among all sequenced SARS-CoV isolates. For our study, 
the nsp 12-coding region was amplified by reverse 
transcription-polymerase chain reaction (RT-PCR) from 
the genome of SARS-CoV Frankfurt-1 (AY291315). 
During amplification, the ORFla/lb —1 ribosomal frame- 
shift near the 5'-end of the gene was corrected in a 
translationally silent manner. Primers used for amplifica¬ 
tion were SAV424 5'-GCGGGTACCCCGCGGTGGAT 
CTGCGGATGCATCAA-3' and SAV425 5'-GCGCGAT 
CG GGATCC CTGCAAGACTGTATGT-3', correspond¬ 
ing to SARS-CoV genome positions 13 327-13 387 and 
16 152-16 166, respectively. The PCR amplicon was 
digested with SacII and BamHI, and ligated into expres¬ 
sion vector pET26-Ub-CHis 6 (26). Following confirma¬ 
tion of the nucleotide sequence of the amplicon and 
using the flanking Xbal and Xhol restriction sites, the 
ubiquitin-nspl2-His 6 fusion gene (Ub-nspl2-CHis 6 ) was 
subcloned into vector pASK3 (IBA) and thereby placed 
under the control of a tetracycline-inducible promoter. 
This ruled out the known risk of potential contamination 
with T7 RNA polymerase, as it can occur when using a T7 
promoter to drive expression of other RdRps in bacterial 
expression systems. Nspl2 active-site mutant D618A, 
carrying an Asp618Ala substitution, was engineered via 
site-directed mutagenesis according to the QuikChange 
protocol (Stratagene), using primers SAV526 5'-CCTTA 
T GGGTT GGGCTT AT CCA A A AT GT G-3' and SAV527 
5'-CACATTTTGGATA AGC CCAACCCATAAGGA-3'. 

For nspl2 expression, the pASK3-Ub-nspl2-CHis 6 
plasmid was transformed to Escherichia coli C2523 cells 
(New England Biolabs) together with the Ubpl protease 
expression plasmid pCGI (26), unless indicated otherwise 
for particular experiments. Routinely, 250 ml of Luria 
Broth, containing ampicillin (50 pg/ml) and chloram¬ 
phenicol (34pg/ml), was inoculated 1:1000 with 3 ml o/n 
precultures and cells were grown at 37°C to OD 60 o >0.7. 
Subsequently, the cells were slowly cooled to room tem¬ 
perature, tetracycline was added to a final concentration 
of 200 pg/ml and the cells were further grown at 20° C for 
another 16 h. Cells were harvested by centrifugation, 
washed once with ice-cold phosphate-buffered saline 
(PBS) and stored at —20°C until protein purification. 

SARS-CoV nspl2 expression vectors pKM596-nspl2 
(kindly provided by Dr. Isabelle Imbert and Dr. Bruno 
Canard) and pET102-Thio-nspl2 (a kind gift of 
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Dr. Susanne Pfefferle and Dr. Christian Drosten) were 
used to transform E. coli BL21(DE3) cells and expression 
was performed using the auto-induction medium 
ZYM-5052 (27). 

Purification of SARS-CoY nspl2-His 

Cell pellets were thawed on ice, resuspended in binding 
buffer (20 mM HEPES pH 7.4, 500 mM NaCl, 20 mM 
imidazole, 10% glycerol, 0.05% Tween-20 and 5mM 
p-mercaptoethanol) and lysed by sonication. Cell debris 
was removed by ultra-centrifugation at 20 000 g and the 
cleared supernatant was incubated with Talon beads 
(Clontech) for 2h at 4°C. The beads were washed three 
times with 20 vol of binding buffer, loaded onto spin 
columns and finally the His-tagged protein was eluted 
with 300 mM imidazole in binding buffer. The eluate 
was analyzed by sodium dodecyl sulfate polyacrylamide 
gel electrophoresis (SDS PAGE) and typically found 
to be 80-85% pure. Subsequently, gel filtration on 
a Superdex 200 column (GE Healthcare) in 20 mM 
HEPES, 500 mM NaCl, 5% glycerol, 5mM p-mercapto- 
ethanol and 0.1% Tween-20 was used to improve purity 
to ~95%. To estimate the molecular mass of nspl2, a 
standard curve for hydrodynamic elution was made 
using chymotrypsinagen A, conalbumin, aldolase and 
dextran blue (all purchased from GE Healthcare) as size 
markers and fitted to a linear equation (R 2 = 0.99). 

After the gel filtration step, the nspl2-containing frac¬ 
tions were pooled, concentrated with Ultrafree 10-kDa 
filter columns (Millipore) and dialyzed overnight against 
1000 sample volumes of dialysis buffer (20 mM HEPES 
pH 7.4, 100 mM NaCl, 0.1% Triton-X 100, 1 mM DTT 
and 50% glycerol) in cellulose tubing (SnakeSkin, Pierce). 
Protein concentrations were determined using a Bradford 
assay and a bovine serum albumin (BSA) standard curve, 
and the purified protein was stored at —20°C. When using 
expression vectors pKM596-nspl2 and pET102-Thio- 
nspl2, protein purification procedures were essentially 
similar to those described above, except that amylose 
resin (New England Biolabs) was used to bind the 
pMK596-nspl2 product, which was eluted using a buffer 
containing lOmM amylose instead of imidazole. 

SDS-PAGE and western blotting 

Cells were lysed in Laemmli’s sample buffer (28), 
separated on 8% SDS-PAGE gels and either electro- 
blotted to polyvinylidene fluoride (PVDF) membrane 
(GE Healthcare) or stained directly with Coomassie 
G-250 according to standard protocols. Prior to incuba¬ 
tion with a SARS-CoV nsp 12-specific rabbit antiserum (a 
kind gift from Dr. Mark Denison) (29), blot membranes 
were blocked with 10% skimmed milk and 0.02% Tween- 
20 in PBS. Blots were washed with 0.02% Tween-20 
in PBS, incubated with a swine-anti-rabbit horseradish 
peroxidase conjugate diluted in 5% skimmed milk, 
0.02% Tween-20 and PBS, washed and subjected to 
ECL Plus chemiluminescence detection (GE Healthcare). 


Preparation of templates 

RNA oligonucleotides and their DNA counterparts 
SAV557R 5' -GCUAUGUGAGAUUAAGUUAU-3' 
and SAV481R 5'-UUUUUUUUUUAUAACUUAAUC 
UCACAUAGC-3', SAV555R 5'- UUUUUUUUUUU 
UUUUUUUUUAUAACUUAAUCUCACAUAGC-3' 
and SAV556R 5'-UCUCUCUCUCUCUCUCUCUCAU 
AACUUAAUCUCACAUAGC-3' were purchased from 
Eurogentec, excised as a single band from 7M urea/15% 
polyacrylamide gels, eluted overnight in deionized water 
and desalted using NAP-10 columns (GE Healthcare). 
To anneal RNA duplexes, oligonucleotide mixtures in 
annealing buffer [20 mM Tris-HCl pH 8.0, 50 mM NaCl 
and 5mM ethylenediaminetetraacetic acid (EDTA)] were 
heat denatured, slowly cooled to room temperature and 
subsequently purified from 15% nondenaturing poly¬ 
acrylamide gels. 

RdRp activity assays 

To test the in vitro RdRp activity of SARS-CoV nspl2- 
CHis 6 , 0.1 pM of the protein (unless otherwise indicated) 
was incubated at 30°C in the presence of 1 pM template, 
0.17 pM [a- 32 P]ATP (0.5 pCi/pl; Perkin-Elmer), 50 pM 
ATP (Roche), 20 mM Tris-HCl (pH 8.0), 10 mM 
NaCl, 10 mM KC1, 6mM MgCL, 5% glycerol, 0.01% 
Triton-X 100, ImM DTT and 0.5 U of RNaseOUT 
(Invitrogen). Glycerol and NaCl were introduced with 
the addition of enzyme due to their presence in the 
nsp 12 storage buffer. RdRp reactions were terminated 
by adding an equal volume of 90% formamide, 50mM 
EDTA pH 8.0, 0.01% bromophenol blue and xylene 
cyanol. Reaction products were analyzed on 7 M Urea, 
20% polyacrylamide gels buffered with lx TBE (90mM 
Tris, 90 mM H 3 B0 3 and 2mM EDTA). Gels were run 
for 5h at 2000 V and analyzed using phosphorimaging. 
Plates were scanned on a Typhoon variable mode 
scanner (GE Healthcare) and products were quantified 
with ImageQuant TL 7.0 (GE Healthcare). To calculate 
the amount of [a- 32 P]AMP incorporated from the 
phosphorimager signal, a 10 -2 -10 -3 dilution series of 
the [a- 32 P]ATP stock was spotted in triplicate on 
Whatman filter and exposed alongside the PAGE gel of 
interest. Using the standard curve, phosphorimager 
counts were converted into the amount of label incorpo¬ 
ration and kinetic traces were fit to a linear function 
with the MATLAB 2008b Curve Fitting Toolbox 
(Mathworks). 

Electrophoretic mobility shift assays 

A dilution series of SARS-CoV RdRp in storage buffer 
was incubated for 10 min at room temperature with 
0.2 nM of 32 P-labeled oligonucleotide SAV557R (see 
above), in the presence or absence of the complementary 
oligonucleotide SAV481R. Samples were then directly 
loaded on to 8% acrylamide gels containing 5% glycerol 
and 0.5x TBE and were run at 100 V for 2h at 4°C. Gels 
were dried on Whatman filter paper and bands were 
quantified by phosphorimaging. Using the MATLAB 
2008b Curve Fitting Toolbox, the percentage of RNA 
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bound was fit to the Hill equation RNAb OU nd = 
b*[nspl2]"/(A'" 1 + [nspl2] n ), where b is the upper binding 
limit, [nspl2] is the nspl2 concentration, n is the Hill coef¬ 
ficient and K d is the enzyme’s dissociation constant. 

RESULTS 

Stable expression and purification of full-length 
SARS-CoV nspl2 

A PCR amplicon encoding SARS-CoV nspl2 was first 
cloned into T7 expression vector pET26-Ub-CHis 6 , 
which provided the 932-aa viral protein with an 
N-terminal Ub tag (76 amino acids) and a C-terminal 
hexahistidine tag (Figure 1A). To avoid the known risk 
of T7 RNA polymerase contamination when purifying 
another RdRp using a T7 promoter-driven expression 
system [like E. coli BL21(DE3)], the Ub-nspl2-CHis 6 
fusion gene was subsequently transferred to plasmid 
pASK3, placing it under control of a tetracycline- 
inducible promoter. Initially, the protease that was later 
used to remove the Ub tag was not co-expressed and the 


expression of the full-length Ub-tagged nspl2-CHis 6 was 
compared to that of variants containing either maltose 
binding protein (MBP) or thioredoxin (Thio) as fusion 
partner. As shown in Figure IB and as also documented 
in the previous work of Cheng et al. (24), expression 
and purification of nspl2 fusion proteins resulted in 
the abundant and progressive fragmentation of the 
protein. Based on their sizes in SDS-PAGE, the 
cleavage products likely were the 64- and 39-kDa nspl2 
degradation products as found by Cheng et al. (24) 
when working with their GST-nspl2 fusion construct 
(Figure IB). Therefore, our results confirmed and 
extended the previously reported observations regarding 
the instability of SARS-CoV nspl2 when expressed in 
E. coli as part of a fusion protein. 

In studies with the poliovirus (PV) RdRp (3D po1 ), it was 
previously shown that N-terminal foreign sequences, even 
the addition of a single methionine residue, can have a 
significant impact on enzyme activity (26,30). Structural 
studies subsequently revealed that a native N-terminus is 
required for the proper folding of the RdRp’s ‘fingers’ 
subdomain and the positioning of the 3D po1 active 
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Figure 1 . Expression and purification of the SARS-CoV nspl2. (A) Expression of nspl2 was performed in the presence of the ubiquitin (Ub) 
protease Ubpl, which removed the N-terminal Ub fusion partner by cleaving after the LRGG signature that formed the junction between the Ub 
and nspl2 moieties. This in vivo cleavage created the native nspl2 amino-terminus with the sequence SADAS. (B) Western blotting using an nspl2- 
specific polyclonal antiserum identified a single band of the expected mass after purification of the in vivo cleaved expression product. Isolation of 
fusion proteins with different N-terminal tags resulted in common patterns of degradation. Similar results were reported by Cheng et al. (24) for a 
GST-nspl2 fusion protein. (C) Purified nspl2-CHis 6 was readily visualized by Coomassie staining as a ~106-kDa protein product, in line with the 
expected mass. Inductions in the absence of Ubpl expression, however, resulted in a mixture of full-length fusion protein and degradation products, 
likely as a result of improper folding. No RdRp activity was observed for these preparations. (D) Gel filtration analysis of purified nspl2 in assay 
buffer showed a single peak, corresponding to a globular molecular weight of 140 ± lOkDa, suggesting that only the monomeric form of nspl2 is 
present under the conditions used. 
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site (30). These findings suggested to us that N-terminal 
additions might also hamper the proper folding of the 
SARS-CoV RdRp and could thus be the principal cause 
of its subsequent degradation. We therefore co-expressed 
the C-terminal Ub protease Ubpl (26), which specifically 
recognizes the cleavage site in our fusion protein that 
is located between the Ub moiety and the N-terminus of 
nspl2 (LRGG\ SADAS). Proteolytic cleavage of the 
fusion protein at this site liberated an nspl2-CHis 6 
product (Figure 1A), now containing its natural N- 
terminus. The expression of the Ubpl protease was 
driven by the relatively weak constitutive ADFU 
promoter (26). Flence, to ensure optimal release of 
nspl2-CFLis 6 by the protease, induction of Ub-nspl2- 
CHis 6 expression was performed only in the middle or 
late log-phase of bacterial growth. Induction at lower 
densities or on-column cleavages performed with the 
cleavable MBP and Thio fusion proteins resulted in sig¬ 
nificant amounts of degraded nspl2, which was inactive in 
RdRp assays (data not shown). 

As depicted in Figure IB and C, the in vivo cleavage 
product that we were able to routinely purify was of the 
expected 106-kDa size (9) and was found to be stable, even 
after multiple rounds of purification and prolonged 
storage at — 20°C. Non-denaturing gel filtration yielded a 
single peak corresponding to a size of about 140 ± lOkDa 
as estimated by comparison to a standard curve (Figure 
1E-F). This is different from the calculated mass of 
106 kDa, but may be explained by, e.g. a very open or 
nonglobular structure of the protein. Moreover, unex¬ 
pected behavior in gel filtration is not without precedent 
for RdRps, as was, e.g. reported for the monomeric cp6 
RdRp P2 (31). Future structural studies involving electron 
microscopy or crystallography will hopefully allow us to 
address this question in more detail. 

SARS-CoV nspl2 is a primer-dependent RdRp 

+ RNA virus replication may proceed through a variety 
of scenarios, but essentially viral RdRps employ only two 
fundamentally different mechanisms during the first step 
of RNA synthesis: initiation can occur either de novo, 
when the 3' hydroxyl group of the first nucleotide essen¬ 
tially is the ‘primer’ to which a second nucleotide is added, 
or initiation can depend on an oligonucleotide or protein 
primer (32). Imbert et al. (13) recently showed that the 
CoV nsp8 subunit, a relatively small protein (22 kDa) 
that forms a hexadecameric complex with nsp7 (33), can 
synthesize short oligonucleotides using poly(C) templates 
or templates containing a 3' CCG/U motif. Although the 
exact role of nsp8 remains to be studied in much more 
detail, one of its proposed functions is that of an ‘RNA 
primase’ (13). According to this hypothesis, and similar to 
what has been documented for cellular DNA polymerases 
(34), the oligonucleotide products of nsp8 activity would 
serve as primers for the viral ‘main RdRp’, i.e. nspl2. 
Some other observations were also consistent with the 
possibility that the SARS-CoV main RdRp belongs to 
the class of primer-dependent polymerases. First, 
modeling of the structure of SARS-CoV nspl2 residues 
400-900 suggested that the central domain of the protein 
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Figure 2. SARS-CoV nspl2 primer extension assay and active-site 
mutant. (A) Schematic showing the structure of the partially double- 
stranded RNA template with 3' U 10 stretch that served as template 
for primer extension in the initial nspl2-based nucleotide incorporation 
assay. (B) Comparison of the RdRp activity of wild-type nspl2 and 
a D618A active-site mutant, which displayed minimal activity after 
a 60-min incubation at 30°C. (C) A primer extension assay using a 5' 
32 P-labeled primer and ATP confirmed that wild-type nspl2, in contrast 
to the D618A mutant, was able to elongate the 20-rner primer. The 
slight reduction of input primer in the D618A lane probably resulted 
from degradation by RNase activity. The D618A mutant showed 
8 ± 3% residual activity (mean of three independent experiments). 
In the loading control (Figure 2B and C, lower panels) the two 
nsp 12 variants were visualized by silver staining of an SDS-PAGE gel. 


contains a G motif, a signature present in all primer- 
dependent RdRps (23). Second, when using the unstable 
GST-nspl2 fusion protein in combination with a poly(A) 
template and a poly(U) primer, Cheng et al. (24) observed 
a moderate RdRp activity in filter-binding assays. 
Convincing evidence for the extension of an RNA 
primer in vitro, however, has not been documented 
thus far. 

To qualitatively assess the polymerase activity of the 
nsp 12, we monitored the incorporation of nucleotides on 
a well-defined synthetic RNA template, which was pre¬ 
dicted to lack secondary structures that could affect the 
enzyme’s processivity. This template (Figure 2A) consisted 
of a 20-nt duplex and a 3' overhang of 10 uridylates, which 
could serve as template during extension of the 20-mer 
primer. As shown in Figure 2, primer extension was 
revealed by both the incorporation of [a- 32 P]ATP 
(Figure 2B) and the extension of the 32 P-labeled primer 
(Figure 2C). The use of single-stranded or homopolymeric 
[poly(U) or poly(C)] templates did not lead to significant 
incorporation of nucleotides (data not shown). 

Based on sequence alignments of SARS-CoV nspl2 
with RdRps of known structure, it was previously 
concluded that the canonical aspartic acid residues 
responsible for the coordination of the divalent cation in 
the RdRp’s active site should be nspl2 residues 618, 760 
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and 761, the latter two amino acids being part of the 
nidovirus-specific SDD motif (23,35-37). In other viral 
RdRps, single or multiple substitutions of the equivalent 
residues with alanine or asparagine were shown to change 
or even abolish polymerase activity (38,39), depending 
on the assay conditions used. For the PV 3D po1 , for 
example, it was found that mutagenesis of the motif C 
core sequence, changing it from GDD to GDN, was 
detrimental in the presence of Mg 2 + , but not when 
Mn 2+ or Fe 2+ were provided as divalent cation (38). To 
obtain an active site mutant that could be used as negative 
control during further development of our assays, we engi¬ 
neered nspl2 mutant D618A. As portrayed in Figure 2B 
and C, the mutant protein displayed only 8 ± 3% residual 
primer extension activity in our assays (Figure 2B-D), 
thus confirming the nsp 12-specific nature of the RdRp 
activity described here and supporting the prior hypothe¬ 
sis that this residue is a key player in the enzyme’s active 
site. Further or complete knockdown of activity may 
be obtained under slightly different assay conditions 
or in combination with substitution of other active site 
residues, which remain to be tested in future studies. 

Biochemical determinants of the in vitro activity of the 
SARS-CoV nspl2 

It is common practice to employ filter binding assays 
during optimization of polymerase activity assays. 
Although convenient and less time consuming than gel 
analysis of reaction products, these assays merely reveal 
incorporation of label, but do not provide direct informa¬ 
tion on the length and integrity of the products. Therefore, 
when exploring the effects of different reaction buffer 
compositions, we consistently quantified and analyzed 
[a- 32 P]ATP incorporation using denaturing 20% 
acrylamide gels. The standard assay buffer consisted of 
20 mM Tris-HCl (pH 8.0), 10 mM KC1, ImM DTT, 
6mM MgCl 2 , 5% glycerol, 0.1% Triton-X 100 and 
50 pM ATP. 

First, the influence of reaction temperature on RdRp 
activity was tested within the range of 20-37°C. Activity 
at 37°C was 40% higher than at 30°C, which is more 
commonly used in RdRp assays (Figure 3A) (37,4(M12). 
However, a more detailed analysis revealed that this 
increase largely stemmed from enhanced initiation of 
primer extension rather than elongation, with the latter 
being decreased by 35% as judged from the amount of 
full-length product (primer + 10, Figure 3B). Therefore, 
all subsequent experiments were performed at 30°C. 
Contrary to expectations however, the major extension 
product never matched the length of the primer plus 
10nt, suggesting that nspl2 generally terminates before 
reaching the end of the primed poly(U) template. The 
results obtained with other templates are discussed 
subsequently. 

It is well established that nucleic acid polymerases 
utilize divalent cations, mostly Mg 2 + , as cofactors to 
bind and coordinate the incoming nucleotide during the 
polymerization reaction (35,36). As shown in Figure 3C 
and D, the activity of nsp 12 was positively correlated with 
the Mg 2+ concentration and reached its optimum at 


6mM. Higher concentrations (>20mM) ultimately abol¬ 
ished activity (data not shown). The SARS-CoV RdRp 
does not require Mn 2+ for RNA synthesis, as previously 
reported for the nsp9-RdRp of the arterivirus equine 
arteritis virus (EAV), a distant relative in the order 
Nidovirales (37). In fact, the presence of Mn 2+ in an 
RdRp assay utilizing a poly(U) template decreased the 
fidelity of the SARS-CoV nsp 12 relative to Mg 2+ and 
facilitated both transversional and transitional misincor- 
porations (Figure 4). The specificity for the nucleotide’s 
sugar moiety seemed not affected, however, since 
deoxyadenosine triphosphate (dATP) was not incorpo¬ 
rated when using an RNA template in the presence 
of Mn 2+ . This discriminates the SARS-CoV RdRp 
from, e.g. its PV equivalent, for which correct base 
pairing was more important than the presence of a 2' 
hydroxyl group (40). 

To establish the importance of pH, we tested SARS- 
CoV RdRp activity in the pH range of 6.0-8.5. We 
clearly observed an increase of nsp 12 overall activity 
with an optimum at pH 8.0 (Figure 3E), but differences 
between pH 7.0 and 8.5 were marginal in terms of both 
overall activity and amount of full-length product 
produced (Figure 3F). Interestingly, pH 7.5 consistently 
resulted in a significantly larger amount of initiation 
product, suggesting that higher pH values stimulate the 
processivity of the enzyme. 

In additional assays, we tested the effect of monovalent 
cations, bovine serum albumin and spermidine (data not 
shown). No negative influence of potassium or sodium 
ions was observed below a concentration of 50 mM. 
Neutral to slightly negative effects were found for 
both BSA (tested for 0-0.1 mg/ml), which is known to 
stabilize enzymes and reduce potential side effects such 
as absorbance to the plastic surface of the reaction tube 
used. Finally, addition of spermidine, which binds to the 
phosphate backbone of nucleic acids and is a known stim¬ 
ulator of various polymerases, did not improve activity 
in the tested concentration range up to 5mM. 

RNA-binding properties of SARS-CoV nspl2 

Our combined observations suggested that nsp 12 is only 
active in a primer-dependent fashion and may thus prefer 
binding to double-stranded RNA (dsRNA). To determine 
the affinity of nsp 12 for dsRNA and ssRNA we performed 
electrophoretic mobility shift assays, in which we titrated 
purified nspl2 in the presence of 0.2nM 5'- 32 P-labeled 
RNA (Figure 5). Subsequently, the percentage of bound 
template and the Hill equation (see ‘Materials and 
Methods’ section) were used to derive the enzyme’s disso¬ 
ciation constants (X d ). For dsRNA the K d was calculated 
at 0.13 ± 0.03 pM, whereas the K d for ssRNA was 
estimated to be 0.10 ± 0.02 pM. 

Nspl2 nucleotide incorporation rates 

For various viral RdRps, in vitro studies have defined bio¬ 
chemical properties that could not be determined or were 
largely obscured in in vivo assays. Furthermore, following 
the basic characterization of individual enzymes, the inter¬ 
play with templates, cofactors and other subunits of the 
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Figure 3. Biochemical determinants of SARS-CoV nspl2 activity. (A and B) Tests to determine the influence of temperature indicated that nspl2 
incorporates ATP most efficiently at 37°C, although this effect mainly stemmed from increased initiation. Based on these results, 30°C was taken as 
the standard for subsequent experiments. (C and D) Titration of the Mg 2+ concentration showed that activity reaches its maximum at 6mM. (E and 
F) The effect of pH on RdRp activity was evident as well, with lower pHs limiting the activity of the enzyme. All reactions were incubated for 60 min 
at 30°C, unless otherwise indicated. Error bars in A, C and E represent standard error of the mean (n = 3). 


viral enzyme complex can be addressed. Ultimately, this 
should facilitate the in vitro reconstitution of the holo- 
enzyme complex and the analysis of its interactions at 
each step of viral replication and transcription. After the 
purification of active SARS-CoV nspl2 and exploration 
of the reaction conditions, our subsequent step toward 
such a future goal was to evaluate the rate of nucleotide 
incorporation by the SARS-CoV RdRp. On a primed Ui 0 
template we found relatively poor incorporation rates 


(1.2 ± 0.4nM/min) and a major product that was 2-3 nt 
shorter than the expected full-length product (Figure 6A), 
with product sizes displaying a similar Gaussian-like dis¬ 
tribution as depicted in the graphs in Figure 3. On the 
other hand, we also observed minor products that were 
slightly longer than full length, suggesting slippage of the 
polymerase at the end of the template. We did not detect 
any terminal transferase activity on the templates used 
(data not shown). 
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Figure 4. Nucleotide incorporation fidelity of nspl2. (A) Experimental 
set-up of pulse-chase experiments with different nucleotides and a 
primed poly(U) template (see Figure 2A). The reactions were initiated 
with a limiting concentration of [ot- 32 P]ATP to allow the incorporation 
of a first nucleotide and the formation of a stable polymerase-template 
complex. Subsequently, after 10 min, different unlabeled nucleotides 
were added to a final concentration of 50 pM to allow elongation 
for another 30 min. (B) SARS-CoV nspl2 allows only limited trans- 
versional and transitional misincorporations. Interestingly, also in the 
presence of dATP no significant activity was observed, implying that 
the SARS-CoV RdRp is capable of discriminating between ATP and 
dATP. (C) Pulse-chase experiments in the presence of 6mM Mn 2 + 
show that manganese ions promote misincorporation of ribonucleotides 
(both transversions and transitions). The selection against dATP 
remained unaltered. 


Previously, Cheng et al. (24) reported that their 
unstable GST-nspl2 fusion protein had limited reverse 
transcriptase activity. In contrast to these prior results, 
our nspl2 was not able to efficiently use dATP to extend 
an RNA primer (Figure 4A and B) and a similar result 
was obtained with a DNA primer (data not shown). 
To test whether nspl2 was able to incorporate [a- 32 P] 
deoxyadenosine monophosphate (dAMP) when using 
DNA templates, we employed a primed T 10 template 
similar to the RNA templates described above. 



A 
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(nM) 0 0.5 5 50 125 250 




[RdRp](|iM) 


Figure 5. RNA binding affinity of nspl2. (A) A fixed concentration 
(0.2 nM) of radiolabeled dsRNA or ssRNA (data not shown) was 
titrated with purified nspl2 and the complexes formed were separated 
from unbound RNA on a native 8% polyacrylamide gel. (B) Free and 
bound RNA were quantified and fit to the Hill equation (see ‘Materials 
and Methods’ section), resulting in K d values of 0.13 ± 0.3 pM for 
dsRNA and 0.10 ±0.2 pM for ssRNA binding. Rr values of these 
Hill fits were 0.97 and 0.98 for dsRNA and ssRNA, respectively. 
Error bars represent standard error of the mean (n = 3). 


As shown in Figure 6B, nspl2 was capable of incorporat¬ 
ing dAMP when incubated with a DNA template. 
On average, a minor reduction (~20%) in dAMP incor¬ 
poration was observed in comparison to the activity on an 
RNA template. The size distribution of products was 
more different, and showed significantly less initiation 
than on an RNA template (compare Figure 6A and B). 

The incorporation rate of nspl2 increased dramatically 
when we doubled the template length (Fu 2 o = 27 ± 3 nM/ 
min, Figures 6C and 7), but we again observed a major 
product that was 2 nt shorter than the expected full-length 
product. When the primed poly(U) template was changed 
to a primed heteromeric (CU)io, nspl2 activity increased 
further (F(cu)io = 45 ± 2nM/min) and reached a level 
comparable to that obtained for other viral RdRps on 
similar short templates (40). When the products of the 
latter reactions were resolved by denaturing PAGE 
(Figure 6D) they were found to be longer than template 
length. Similar products extending beyond unit length 
have been often observed when short templates were 
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Figure 6. Analysis of nucleotide incorporation by nspl2. (A) Reactions with 0.1 pM nspl2, 1 pM template and 50 pM ATP were followed over time 
and quenched with 50 mM EDTA at the time points indicated. Gel analysis showed that, similar to the curves in Figure 3, the major accumulating 
product is the primer (p) extended by 7nt (i.e. p + 7) and not the expected full-length product of n + 10. (B) To assess whether nspl2 was capable of 
DNA-templated incorporation of dATP, reactions were performed as in Figure 6A, but now using a DNA template. The time course illustrates 
robust [a- 32 P]dAMP incorporation, implying that the SARS-CoV RdRp is able to utilize both RNA and DNA templates. The incorporation of 
dAMP was 20% lower than that of AMP on an RNA template. (C) Time courses were additionally performed using an RNA template containing a 
U 20 template region. Similar to the shorter 10-nt template used in Figure 6A, the major product here was around 2nt shorter than the expected full- 
length product. (D) Changing the single-stranded template from U 20 to a heteromeric (CU)io not only increased the nucleotide incorporation rate by 
34%, but also the behavior of nspl2 on the template. Products of the expected full-length size (p + 20) were observed, but also products longer than 
the template, hinting at, e.g. template switching by nspl2. Future research will address this observation in more detail. In this reaction both 50 pM 
ATP and GTP were present. The gel image represents the top half of a 20% denaturing PAGE gel. 


used for PV 3D po1 (43) and their generation can be 
explained by various mechanisms, such as distributive 
slippage, processive slippage, template switching or 
terminal transferase activity (43 and references therein). 
Future research is expected to address these mechanisms 
in more detail. 


DISCUSSION 

Detailed kinetic and stoichiometric information plays an 
important role in explaining the molecular basis of RdRp- 
catalyzed ribonucleotide incorporation, the associated 
mutation rate of viral genomes and, consequently, RNA 
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Figure 7. Analysis of the nucleotide incorporation rate of nspl2. 
Steady-state time courses were performed with 0.1 pM nspl2 as described 
in Figure 6. Experimental data were subsequently fit to linear regression to 
obtain NMP incorporation rates. These clearly illustrated an increase of 
the incorporation velocity with template length (K ul0 = 1.2 ± 0.4nM/ 
min and Uj 2 o = 27 ± 3nM/min) and from homopolymeric template 
to copolymeric template (K(cu)io = 45 ± 2nM/min). R 2 values were 
0.97, 0.99 and 0.99, respectively, and error bars indicate standard error 
of the mean (n = 3). 

virus evolution. Furthermore, a comprehensive under¬ 
standing of the differences between viral RdRps and 
cellular polymerases is crucial in the design of specific 
inhibitors of viral RNA synthesis and their subsequent 
mechanistic characterization and optimization. Up till 
now, all of the above information was particularly 
limited, if not completely lacking, for the CoV RdRp, 
due to enzyme insolubility, instability and low purification 
yields. 

This study reports the first successful expression and 
purification of stable, soluble SARS-CoV nspl2, thus 
paving the way for its further biochemical characteriza¬ 
tion. Using a two-step procedure, the protein was purified 
to near homogeneity, as confirmed by SDS PAGE and 
non-denaturing gel filtration. We found that even a 
single-step purification scheme could produce an enzyme 
preparation that combined specific activity with reason¬ 
able purity (>80-85%). Of particular importance in our 
expression protocol was the co-expression of the Ubpl 
protease that mediated the in vivo removal of the 
N-terminal Ub tag (Figure 1A) (26). Apparently, similar 
to PV 3D po1 (30), SARS-CoV RdRp does not tolerate 
additional amino acids at its N-terminus, likely because 
they induce an unstable and largely inactive fold. Because 
of the observed sensitivity to N-terminal fusions, it is 
tempting to speculate that the processing of the pplab 
cleavage generating the nspl2 N-terminus is of great 
importance for proper folding and activity in vivo as 
well. In line with this notion, it was reported that for the 
related group 2 CoV mouse hepatitis virus (MHV) this 
cleavage site indeed is indispensable for virus replication 
(44). It still remains to be tested whether there is any effect 
of the C-terminal His-tag on the activity of the enzyme 
used in this study. 

The SARS-CoV nspl2 is a relatively large protein 
(932-amino-acid residues), e.g. much larger than the well 
described RdRps of HCV (591 amino acids) and PV (461 
amino acids), and may have, in addition to the C-terminal 


RdRp domain, other functional or contributing domains 
in its still uncharacterized N-terminal domain(s). 
Previously, the PV and HCV RdRps were demonstrated 
to bind RNA with K d values of 2-10 pM (45) and ~3 pM 
(46), respectively. Our data suggest that SARS-CoV nspl2 
binds RNA with slightly higher affinity for both dsRNA 
and ssRNA (AT d ~ 0.1 pM). The relatively similar affinities 
for dsRNA and ssRNA suggest that the lack of de novo 
initiation may largely be due to a relatively open structure 
and the absence of a conserved priming loop in the pre¬ 
dicted thumb subdomain (23). The latter structural 
element, a (3-hairpin in, e.g. hepatitis C virus (HCV) 
NS5B, is present in many known de novo initiating 
RNA polymerases. Canonically, it functions by partly 
obstructing the template binding cleft and is believed to 
be essential for association with the single-stranded 3' end 
of the template RNA (23,47,48). 

Contrary to previous GST-pull downs performed for 
nspl2 (49) and data obtained for other RdRps such as 
the PV 3D po1 —which was reported to form multimers 
and even lattices (45)—we were not able to establish 
nspl2 multimerization using non-denaturing gel filtration. 
Although this suggests that SARS-CoV RdRp exists 
predominantly as monomer under the conditions used 
here, it is likely that nspl2 binds to various other viral 
replicase subunits (50) and cellular cofactors (51) in vivo. 
The CoV holo-enzyme complex formed in this manner 
may have more finely tuned characteristics than docu¬ 
mented in this study. In addition, it remains to be tested 
whether the C-terminal hexahistidine tag interferes with 
multimerization. Clearly, these and other questions 
should be addressed in more detail in future studies. 

Upon binding to template and primer, we found nspl2 
to have reproducible nucleotide incorporation rates on 
the templates tested, but incorporation depended signi¬ 
ficantly on the length of the template. Furthermore, on 
heteromeric templates the SARS-CoV RdRp also per¬ 
formed better and reached incorporation rates in the 
same order (>nM/min) as the model polymerases from 
PV and HCV, when correcting for enzyme and template 
concentrations (40,52). In addition, the SARS-CoV nspl2 
is also active on a DNA template, which may greatly 
facilitate further investigations in view of the superior 
stability of DNA templates and the fact that various 
other SARS-CoV enzymes (17,33,53) have documented 
affinity for DNA as well. It remains to be tested whether 
SARS-CoV nspl2 is also active on a longer single- 
stranded template, e.g. RNA templates based on the 
terminal sequences of the viral genome, and whether it is 
able to traverse the RNA secondary structures present in 
such a molecule. Taken together, our data suggest that our 
present purification scheme and assay constitute a 
concrete starting point for further biochemical exploration 
of the SARS-CoV RdRp and nspl2 targeted inhibitor 
development. 
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